Hybrid HMM/Neural Network based Speech Recognition in Loquendo ASR
نویسندگان
چکیده
This paper describes hybrid Hidden Markov Models / Artificial Neural Networks (HMM/ANN) models devoted to speech recognition, and in particular Loquendo HMM/ANN, that is the core of Loquendo ASR. While Hidden Markov Models (HMM) is a dominant approach in most state-of-the-art speaker-independent, continuous speech recognition systems (and commercial products), Artificial Neural Networks (ANN) are universally known as one the most powerful nonlinear methods for pattern recognition, time series prediction, optimization and forecasting. Hybrid HMM/ANN, introduced in the nineties for speech recognition, is presently a very competitive alternative to HMM, both in terms of performances and recognition accuracy. HMM/ANN combines the advantages of both approaches by using an ANN (a multilayer perceptron) to estimate the state dependent observation probabilities of a HMM, instead of Gaussian mixtures, while the temporal aspects of speech are dealt with by left-to-right HMM models. HMM/ANN can provide discriminative training, are capable of incorporating multiple input sources, and have a flexible architecture which can easily accommodate contextual inputs and feedbacks. Furthermore, ANN are typically highly parallel and regular structures, which makes them especially suited for high-performance architectures and optimized implementations.
منابع مشابه
شبکه عصبی پیچشی با پنجرههای قابل تطبیق برای بازشناسی گفتار
Although, speech recognition systems are widely used and their accuracies are continuously increased, there is a considerable performance gap between their accuracies and human recognition ability. This is partially due to high speaker variations in speech signal. Deep neural networks are among the best tools for acoustic modeling. Recently, using hybrid deep neural network and hidden Markov mo...
متن کاملMyanmar Language Speech Recognition with Hybrid Artificial Neural Network and Hidden Markov Model
There are many artificial intelligence approaches used in the development of Automatic Speech Recognition (ASR), hybrid approach is one of them. The common hybrid method in speech recognition is the combination of Artificial Neural Network (ANN) and Hidden Markov Model (HMM). The hybrid ANN/HMM is able to classify the phoneme model and to combine the strength of HMM in sequential modeling struc...
متن کاملOn recognition of non-native speech using probabilistic lexical model
Despite various advances in automatic speech recognition (ASR) technology, recognition of speech uttered by non-native speakers is still a challenging problem. In this paper, we investigate the role of different factors such as type of lexical model and choice of acoustic units in recognition of speech uttered by non-native speakers. More precisely, we investigate the influence of the probabili...
متن کاملA Initial Attempt on Task-Specific Adaptation for Deep Neural Network-based Large Vocabulary Continuous Speech Recognition
In the state-of-the-art automatic speech recognition (ASR) systems, adaption techniques are used to the mitigate performance degradation caused by the mismatch in the training and testing procedure. Although there are bunch of adaption techniques for the hidden Markov models (HMM)-GMM-based system[3], there is rare work about the adaption in the hybrid artificial neural network (ANN)/HMM-based ...
متن کاملHybrid System of Optimal Self Organizing Maps and Hidden Markov Model for Arabic Digits Recognition
Thanks to Automatic Speech Recognition (ASR), a lot of machines can nowadays emulate human being ability to understand and speak natural language. However, ASR problematic could be as interesting as it is difficult. Its difficulty is precisely due to the complexity of speech processing, which takes into consideration many aspects: acoustic, phonetic, syntactic, etc. Thus, the most commonly used...
متن کامل